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INTRODUCTION 

One  of  the  hallmarks  of  tumor  suppressor  genes  is  that  both  copies  must  be 
inactivated  for  the  development  of  the  malignant  phenotype.  A  phenomenon  associated  with 
this  inactivation  is  that  a  single  recessive  mutation  is  exposed  following  loss  of  a 
chromosomal  region  containing  the  normal  copy  of  genes.  This  event  is  usually  manifested  in 
tumors  as  loss  of  heterozygosity  (LOH)  when  compared  to  normal  tissue  from  the  same 
individual.  Genome  screening  for  LOH  in  breast  cancer  has  identified  chromosome  region 
lp31  as  one  commonly  (50%)  involved  in  LOH  indicating  the  site  of  a  tumor  suppressor 
gene.  By  comparison  of  LOH  in  a  large  number  of  tumors,  the  minimal  region  has  been 
defined  by  two  markers  which  are  500-800  Kb  apart.  An  extended  region  of  loss  spans 
approximately  2  Mbp.  The  overall  aim  of  this  project  is  to  characterize  the  region  of 
chromosome  lp31  which  shows  consistent  loss  of  heterozygosity  in  breast  tumors  by 
constructing  a  BAC  contig  across  the  region,  searching  for  genes  in  this  contig  and  then 
performing  a  mutation  analysis  to  determine  which  ones  are  involved  in  tumori genesis.  One 
gene,  TTC4,  was  identified  in  this  region  and,  after  extensive  analysis,  could  not  be  shown  to 
be  involved  in  breast  cancer.  It  has  since  been  shown  that  this  gene  is  mutated  and  so 
potentially  involved  in  the  development  of  some  melanomas.  Ip31  is  also  frequently  involved 
in  LOH  in  melanomas.  To  identify  more  genes  in  the  critical  lp31  region  involved  in  breast 
cancer,  we  extended  our  search  for  candidates  using  a  variety  of  approaches  but,  in  particular, 
using  the  latest  releases  of  sequence  form  the  Human  Genome  Project. 

BODY 


In  our  previous  report  we  described  the  construction  of  a  YAC  contig  across  the 
minimally  deleted  region  which  extended  approximately  1500  Kb.  In  the  early  part  of  this 
year  we  began  a  series  of  hybrid  cDNA  capture  experiments  to  isolate  genes  from  a  breast 
cancer  cDNA  library.  This  approach  depends  on  hybridizing  cDNA  clones  from  a  total 
library  to  YAC  DNA  and  then,  through  successive  rounds  of  amplification  and  selection,  a 
number  of  clones  are  recovered  for  testing  which  apparently  have  homology  to  the  target 
YAC  clone.  Because  of  the  presence  of  repetitive  sequences  in  many  cDNA  clones,  however, 
the  risk  of  isolating  false  positives  is  very  high  using  this  approach,  especially  with  YACs 
since  they  are,  on  average,  1  Mbp  long.  After  a  series  of  experiments,  we  isolated  a  number 
of  candidate  cDNAs  but  were  unfortunately  not  able  to  map  any  of  the  clones  back  to  the 
YAC  contig  used  in  the  screen.  At  this  point,  data  from  the  Whitehead  Institute  describing 
new  releases  of  the  human  gene  map  for  chromosome  1 ,  although  not  identifying  any  full- 
length  genes  in  the  critical  lp31  region,  did  map  the  progesterone  receptor  gene  (PTGFR)  to 
the  distal  border  of  this  region.  Because  of  the  potential  involvement  of  this  receptor  in  breast 
cancer  development,  we  decided  to  submit  this  gene  to  an  analysis  by  SSCP/mutation 
screening  and  DNA  sequencing.  In  order  to  establish  the  exon/intron  structure,  we  identified 
a  BAC  clone  which  contained  the  PTGFR  gene  and  then,  by  aligning  the  cDNA  sequence 
with  the  genomic  sequence,  were  easily  able  to  establish  the  exon  structure.  The  PR  gene 
only  contains  three  exons  and  so  primers  were  designed  (Table  1)  to  amplify  the  individual 
exons  and  subject  them  to  SSCP  analysis.  This  approach  identifies  single  strand  PCR 
products  which  show  altered  mobility  on  acrylamide  gels,  indicative  of  a  DNA  sequence 
change  (either  mutation  or  polymorphism).  Since  each  of  the  PTGFR  exons  were  large  (>250 
bp),  we  designed  a  series  of  overlapping  PCR  products  to  cover  each  exon  (Table  1).  We 
used  32  tumor  samples  from  breast  cancers  in  this  analysis  and  identified  a  number  of  band 
changes.  When  sequenced,  these  changes  proved  to  be  silent  polymorphic  variants  resulting 
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in  silent  changes  in  the  amino  acid  code  for  the  gene  (Table  2).  From  this  survey,  therefore, 
we  concluded  that  PR  gene  is  not  frequently  mutated  in  breast  cancer. 

At  this  point  the  accelerated  pace  of  the  human  genome  sequencing  project  dictated 
an  alternative  approach,  which  was  to  build  up  BAC  contigs  from  sequence  databases  across 
the  lp31  region  and  then  use  database  searching  to  identify  ESTs  and  genes.  Using  the 
available  physical  map  which  was  used  to  created  the  YAC  contig,  we  searched  the  high- 
throughput  genome  sequencing  database  for  BACs  across  the  region.  These  are  shown  in 
Figure  2.  These  individual  BACs  were  identified  and  then  overlapping  BACs  from  the  same 
databases  were  identified  by  BLAST  searches.  The  sequencing  effort  is  being  undertaken  by 
the  Sanger  Center  in  the  UK  and  the  genome  center  at  Washington  University.  Since  there  is 
considerable  redundancy  in  these  two  efforts,  it  was  possible  to  build  up  a  contig  of  BACs 
with  only  four  gaps,  the  lengths  of  which  cannot  be  determined  at  this  stage.  Of  the  2-3  Mbp 
region,  approximately  50%  has  been  sequenced;  thus,  there  is  adequate  sequence  to  begin  to 
build  up  a  transcription  map  of  the  region  to  search  for  candidate  genes. 

Using  BLAST  searches  and  the  Genome  Channel  program  (GenScan)  at  Oak  Ridge 
National  Laboratories,  we  were  able  to  identify  ESTs  from  the  region.  These  are  summarized 
in  Table  3.  Firstly,  It  has  been  an  important  observation  that  there  are  few  genes  for  which  the 
full-length  sequence  to  map  this  region  is  known.  A  number  of  ESTs  were  identified,  but 
most  of  these  cluster  at  the  proximal  end  of  the  contig  and  include  the  latrophilin  gene  which 
has  previously  been  shown  not  to  be  mutated  in  breast  cancer  cells.  The  only  other  gene  in 
this  region  is  the  DR1L2P  gene  which  is  a  pseudogene  for  the  DREL  transcription  factor.  The 
distal  60%  of  this  contig  apparently  contains  no  ESTs,  either  through  BLAST  searches  or 
through  analysis  of  the  sequence  from  the  individual  BACs  using  GenScan.  Although  a 
number  of  individual  exons  are  predicted  in  this  region  using  GRAIL,  no  extensive  gene 
structures  could  be  identified.  Even  though  these  gene  prediction  tools  are  notoriously 
inaccurate  in  their  ability  to  identify  genes,  it  is  still  true  that  the  region  is  very  gene  poor 
which  has  the  advantage  that  any  gene  which  is  found  has  a  higher  probability  of  being  the 
one  reduced  to  homozygosity  in  breast  cancers. 

Within  the  proximal  clusters  of  ESTs,  a  number  show  homologies  to  Unigene  clusters 
which  provide  for  more  cDNA  sequence  from  these  genes.  Despite  this,  the  available 
sequence  still  does  not  provide  any  indication  of  the  potential  function  of  any  of  these  genes. 
We  are  now  concentrating  on  using  the  available  sequence  information  to  screen  cDNA 
libraries  to  identify  more  full-length  clones. 

KEY  RESEARCH  ACCOMPLISHMENTS 

-  Construction  of  a  BAC  contig  across  the  LOH  region  in  lp31. 

-  Analysis  of  approximately  1700  kb  of  sequence  using  GenScan  to  predict  genes  in  the 

critical  region. 

-  Identification  of  ESTs  and  Unigene  clusters  in  the  lp31  region 

-  Exclusion  of  the  progesterone  receptor  gene  in  the  development  of  breast  cancer 

REPORTABLE  OUTCOMES 

None 

CONCLUSIONS 

As  a  result  of  advances  in  the  development  of  the  human  genome  sequencing  project,  we 
have  abandoned  traditional  gene  cloning  strategies  and  have  concentrated  on  building  up  the 
contig  of  complete  sequence  data  for  the  critical  BRCA  region  in  lp31.  This  analysis 
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identifies  approximately  1.7  Mbp  of  the  region  with  only  4  gaps.  An  analysis  of  this  data 
places  the  progesterone  receptor  at  the  outer  limit  of  the  contig  and  mutation  analysis  of  this 
gene  does  not  support  its  role  in  the  consequences  of  LOH  .  We  have  identified  27  ESTs  to 
date  in  the  critical  region  and  five  Unigene  clusters.  None  of  the  data  available  for  genes  in 
this  region  allows  for  a  prediction  of  their  function.  The  construction  of  a  rudimentary 
transcription  map  for  the  lp31  region  involved  in  breast  cancer  now  means  that  the  isolation 
of  full  length  genes  will  be  possible  by  cDNA  library  screening.  As  the  sequence  data 
become  more  complete,  the  generation  of  a  full  transcription  map  will  become  possible. 

REFERENCES 

None 

APPENDICES 

None 
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Figure  1 :  Summary  of  the  BAC  map  of  the  BRCA  1  p31  region  relative  to  the  physical  map. 
The  location  of  ESTs  identified  by  BLAST  searches  are  shown. 
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Table  2:  Summary  of  the  silent  polymorphisms  found  in  the  PTGFR  gene  in  breast 
cancers.  The  2  polymorphisms  with  a  minus  (-)  annotation  occur  within  the  intron  for 
exon  2. 


Exon 

2 


nucleotide  amino  acids 


C63T  T21T 

-152  T  ->  C 
-62  T ->  C 

G435A  T145T 
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BAC  clone 

Acc# 

EST  Acc# 

UniGene 

Cluster 

Gene  name 

RP3-445O10 

AL158173 

AA909664 

AA827584 

Hs.  132428 

RP5-837I24 

AL138799 

R 10940 

AI147471 

AA406063 

BE894577 

BE279803,  BE256076 
AI267423 

Hs.  146741 
Hs.98003 

DREL2P 

RP4-653E17 

AL138841 

AI732798,  AA487326 
AW105209 

Hs. 188839 
Hs. 242795 

RP11-100D1 

AL358939 

AW270271,  AI243907 
AA347902 

W03266 

AA347901 

N69441 

Hs.  149086 

RP11-211B4 

AL359705 

AI005162 

AA369954 

R01807 

AA347901 

AA259171 

BE066265 

T06662 

H63565 

H63611 

BE709816 

AL040055 

Hs.24212 

LATROPEDOLIN 

Table  3:  List  of  EST  markers  and  their  respective  UniGene  clusters  that  map  to  BAC  clones 
from  the  breast  cancer  critical  region  in  lp32. 
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